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Abstract — We give an approximation-preserving reduction 
from vector scheduling problem (VS) to generalized load bal- 
ancing problem (GLB). The reduction bridges existing results of 
the two problems. Specifically, the hardness result for VS holds 
also for GLB and any algorithm for GLB can be used to solve VS. 
Based on this, we get two new results. First, GLB does not have 
constant approximation algorithms that run in polynomial time 
unless P = NP. Second, there is an online algorithm (vectors 
coming in an online fashion) that solves VS with approximation 
bound e log(mrf), where e is the natural number, m is the number 
of partitions and d is the dimension of vectors. The algorithm 
is borrowed from GLB literature and is very simple in that 
each vector only needs to minimize the Li n (md) norm of the 
resulting load. However, it is unclear whether this algorithm runs 
in polynomial time, due to the irrational and non-integer nature 
of ln(md). We address this issue by rounding ln(md) to the next 
integer. We prove that the resulting algorithm runs in polynomial 
time with approximation bound elog(md) + ln Y^/)+i which is 
in 0(ln(md)). This improves the 0(ln 2 d) bound of the existing 
polynomial time algorithm for VS. 

I. Introduction 

Scheduling with costs is a very well studied problem in 
combinatorial optimization. The traditional paradigm assumes 
single-cost scenario: each job incurs a single cost to the 
machine that it is assigned to. The load of a machine is the 
total cost incurred by the jobs it serves. The objective is to 
minimize the makespan, the maximum machine load. Vector 
scheduling and generalized load balancing extend the scenario 
in different directions. 

Vector scheduling assumes that each job incurs a vector 
cost to the machine that it is assigned to. The load of a 
machine is defined as the maximum cost among all dimen- 
sions. The objective is to minimize the makespan. Vector 
scheduling is a multi-dimensional generalization of the tra- 
ditional paradigm. It finds application in multi-dimensional 
resource scheduling in parallel query optimization [fj, where 
it is not suitable to represent different requests (such as 
CPU, memory, network resource, etc.) as a single scalar. 
To solve vector scheduling, there are three approximation 
solutions |1|. Two of them are deterministic algorithms based 
on derandomization of a randomized algorithm, with one 
providing 0(ln 2 d) approximatiorj^] where d is the dimension 

The work was done when the first author was visiting the College of 
William and Mary. 

'in this paper, e denotes the natural number, ln(-) denotes the natural 
logarithm, and log(-) denotes the logarithm base 2. 



of vectors, and the other providing 0(\nd) approximation 
with running time polynomial in n d , where n is the number 
of vectors. The third algorithm is a randomized algorithm, 
which assigns each vector to a uniformly and randomly chosen 
partition. It gives O (In dm / In In dm) approximation with high 
probability, where m is the number of partitions (servers). For 
fixed d, there exists a polynomial time approximation scheme 
(PTAS) ID. A PTAS has also been proposed for a wide class 
of cost functions (rather than max) 0. 

Generalized load balancing is recently introduced to model 
the effect of wireless interference [3][4|. Each job incurs costs 
to all machines, no matter which machine it is assigned to. 
The exact cost incurred by a job to a specific machine is 
dependent on which machine the job is assigned to. The load 
of a machine is the total cost incurred by all the jobs, instead 
of just the jobs it serves. This model is well suited for wireless 
transmission, since, in wireless network, a user may influence 
all APs in its transmission range due to the broadcast nature 
of wireless signal. To solve the generalized load balancing 
problem, the current solution is an online algorithm, adapted 
from the recent progress in online scheduling on traditional 
model |5|. The solution, though provides good approximation, 
is rather simple: each job selects the machine to minimize the 
L T norm of the resulting loads at all machines where t is a 
parameter. 

To avoid confusion, we keep the two terms job and machine 
unchanged for generalized load balancing, while refer to job 
and machine in the vector scheduling model as vector and 
partition respectively. 

We make two contributions. First, we present an approach 
to encode any vector scheduling instance by an instance of 
generalized load balancing problem. This encoding method 
brings the recent progress in generalized load balancing into 
the vector scheduling domain, and takes the hardness result 
of vector scheduling to generalized load balancing problem. 
Second, the existing solution to GLB needs to compute the 
L\ n i norm (I is the number of machines), but it is unclear 
whether this norm can be computed in polynomial time. We 
eliminate this uncertainty by rounding In I to the next integer, 
guaranteeing polynomial running time. In addition, we prove 
that the approximation loss due to rounding is small. 

We conclude this section by the following two definitions. 



A. Vector Scheduling 

We are given positive integers n, d, m. There are a set 
V of n rational and d-dimensional vectors p\ , p2, . . . , p n 
from [0,oo) d . Denote vector pi = (pn, . . . ,Pid). We need 
to partition the vectors in V into m sets Ai,...,A m . The 
problem is to find a partition to minimize maxi<j< m ||A||oo 
where A^ — J^jeA - Pj * s me sum °f me vect ors in Aj, and 
1 1 A; | joe is the infinity norm defined as the maximum element 
in the vector Aj. For the case m > n, there is a trivial optimal 
solution that assigns vectors to distinct partitions. Therefore, 
we only consider the case m < n. 

For ease of presentation, we give an equivalent integer 
program formulation. Let be the indicator variable such 
ij = 1 if and only if vector is assigned to partition 
n 

d * — ' 
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Aj. Then 



A, 
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Kk<c 



The vector scheduling problem can be rewritten as 



minmax ^ 



subject to 



1, Vi 



(VS) 



J 

Xij £ {0, 1}, Vi, j 

B. Generalized Load Balancing 

This formulation first appears in ||4). We reformulate it with 
slightly different notations. There are a set M. of independent 
machines, and a set J of jobs. If job i is assigned to machine 
j, there is non-negative cost cyjt to machine k. The load 
of a machine is defined as its total cost. The problem is to 
find an assignment (or schedule) to minimize the makespan, 
the maximum load of all the machines. This problem can be 
formally defined as follows. 

mm max XijCijk 



subject to 



(GLB) 



= 1, Vi € J 

3 

Xij e {0,1}, VieJJeM 

where x e {0, l}!- 7 !*!-^ is the assignment matrix with 
elements x^ = 1 if and only if job i is assigned to machine 
j. The two constraints require each job to be assigned to one 
machine. 

II. Encoding Vector Scheduling by Generalized 
Load Balancing 

We first create a GLB instance for any VS instance, then 
prove their equivalence. At last, we discuss the hardness of 
GLB and extend the VS model. 



A. Creating GLB Instances 

Comparing VS to GLB, we can find that they mainly differ 
in the subscripts of max and J^. Our construction is inspired 
by this observation. 

Given as input to VS the vector set V and m partitions, we 
construct the GLB instance as follows. We set the jobs J = V. 
For each partition Aj and its fc-th dimension, we construct a 
machine, denoted by the pair (j, k). Thus, the constructed ma- 
chine set Ai is {(j, k) | j = 1, 2, . . . , m and k = 1,2, ... ,d}. 
We refer to a machine as a pair of indices so that we can 
map the machine back to its corresponding partition and 
dimension easily. For a machine t = (j, k) where t € A4, 
we refer to the partition j as t\, and the dimension k as 
t%, i.e., t = (tijta)- We can see that there are totally d 
machines (t included) corresponding to the same partition as 
the machine t. We denote [t] as the set of these machines, i.e., 
[t] = {(j, 1), U, 2), ■ ■ • , ij, d)}, where j = t x . Among these d 
machines, we select the first one (j, 1) as the anchor machine, 
denoted by t, such that a vector chooses partition Aj in VS if 
and only if the corresponding job chooses t in the new GLB 
problem. 

The incurred cost Ci St of job i to machine t if i chooses 
machine s is defined as 



Cist 



Pzt 2 
OO 





if s = t 

if s € [t] A s ^ t 



(1) 
(2) 
(3) 



if s <£ [t] 

where <[TJ([2]> are for the situation where s and t correspond to 
the same partition. They force a job to select only the anchor 
machines. ([3]) is for the situation where s and t correspond to 
different partitions. In this case, there is no load increase. 
The resulting GLB instance is defined as VS-GLB: 



mm max 

x' t 



subject to 



Cist 



(VS-GLB) 



S 

x' is e{o,i}, VieJ,seM 

To avoid the confusion with the general GLB problem, we 
intentionally use different notations x', s and t. The notation 
i is kept since it is in 1-1 correspondence with the vectors in 
VS. 

As an example, consider the case when d = 1. All vectors 
in VS have only one element, and there is only one machine 
in VS-GLB representing a partition in VS. The objective 
of VS becomes maxjJ2i x ijPn- On the other hand, the 
objective of VS-GLB is max t ^ x' it c itt = maxj £\ x' it Pix. 
Since any machine t corresponds to a distinct partition Aj, 
simply changing subscripts shows that the two problems are 
equivalent. For the case when d > 1, the proof is much 
involved, which we delay to Section |II-B| 

Theorem 1. The construction of VS-GLB can be done in 
polynomial time. 



Proof: An instance of VS needs VL(nd) bits. The con- 
structed VS-GLB instance has n jobs, md machines and 
n(md) 2 costs. Since to < n, all three terms are polynomials 
in n and d. The theorem follows immediately. ■ 
The following theorem shows that the constructed VS-GLB 
problem is equivalent to its corresponding VS problem. Let T 
be a positive constant. 

Theorem 2. There is a feasible solution x to VS with objective 
value T if and only if there is a feasible solution x' to VS-GLB 
with the same objective value T. 

This theorem shows that VS and its corresponding VS- 
GLB have the same optimal value. In addition, any c- 
approximation solution to VS-GLB, after transformation, is 
also a c-approximation solution to VS, vice versa. We prove 
this theorem in Section IH-BI 

It is worth mentioning that VS-GLB is a special instance of 
GLB. Since VS-GLB is converted from VS, VS is a special 
instance of GLB, which implies that VS should have approx- 
imation algorithms at least as good as GLB. Unfortunately, 
on the contrary, the literature shows better approximation 
algorithm for GLB than that for VS. Hence, it is worth 
applying algorithms of GLB to VS. 

B. Proof of Equivalence 

We first study the properties of feasible solutions to VS- 
GLB in Lemma 1 and Lemma 2, and then prove Theorem [2] 

Lemma 1. Given a feasible solution x' to VS-GLB yielding 
objective value T, for any i £ J , we have 

1) Vs ^ s,x' is = 0; 

2) 3j such that for s — (j, l),x' is = 1. 

Proof: For 1), suppose x' is = 1 for some s with 
s ^ s. Then x' is Ci s j — oo > T, contradicting with 
max t J2i, s x i S c lst = T. 

For 2), since J2 S x '%s = 1> there exists some s such that 
x' is = 1. Due to 1), we must have s = s, ■ 

Lemma 2. Given a machine t, a job i, and a feasible 
solution x' to VS-GLB yielding objective value T, we have 

X/s x is c ist — x i jPit 2 - 

Proof: Recall that [t] = {(t 1: 1), 2), . . . , (tx, <£)}. We 



have 



^ t x is c ist — ^ ] 

«e[t] 

= E 

sG[t] 



x is c ist + ^ x is c ist 



x is c ist 



(4) 

(5) 
(6) 



where Q is due to (j3j, |5]) is due to Lemma [T] and |6| is due 
to <[]}. ■ 

With the two lemmas, we can now prove the equivalence. 
Proof of Theorem ^ Gj ven a feasible solution x 

to VS, construct a feasible solution x' to VS-GLB as follows. 



Set x' a = Xi Sl and all others to be 0. We first show that x' 
is a feasible solution to VS-GLB. Obviously, x' is an integer 
assignment. We will check that J2 S x 'i S = !■ Observe that 
x' is = if s s. We only need to consider m machines 
(1, 1), (2, 1), . . . , (to, 1). Since x is a feasible solution to VS, 
then for any i £ V, there exists one and only one partition 
Aj such that Xi j = 1. Our transformation sets x' is — 1 where 

> So v.,-;. 1. 

Second, we prove that the objective values of the two 
feasible solutions are equal. 



max^ x' is c is t = max ^ x L c ist 



.£[*] 



maxy^ x itl Pit 2 



(7) 



(8) 



max y~]xijPik, 



where (|7]) is due to that c; LS t — if s (£. [t], and ^ is due to 
our assignment of x' that x' is = if s ^ s. 

"^=" Given x' for VS-GLB, construct x for VS as follows. 
Set Xij = x'^ where s± = j. We show that x is a feasible 
solution to VS. Due to Lemma [T] for any i, there exists one 
s such that x' is = 1 and s — s. Therefore, there exists one j 
such that x^ = 1, On the other hand, there cannot be two js 
both with Xij = 1, otherwise x' is not a feasible solution to 
VS-GLB. 

For the objective value, we have 



max ) XijPik = max 
3,h ^ t=(j,k) 



^ ' x it 1 Pit 2 



= max^4 c »s^ ( 9 ) 
where (|9| is due to Lemma [2] This completes our proof. 

■ 

C. Inapproximability for GLB 

It has been proved that no polynomial time algorithm can 
give c-approximation solution to VS for any c > 1 unless 
NP = ZPP Combining this result with Theorem [2j we 
have the following theorem. 

Theorem 3. For any constant c > 1, there does not exist a 
polynomial time c-approximation algorithm for GLB, unless 
NP = ZPP. 

Proof: Since VS-GLB is a special instance of GLB, any 
c-approximation algorithm for GLB can be used to obtain c- 
approximation solution to VS-GLB. By Theorem [2j any c- 
approximation solution to VS-GLB is also a c-approximation 
solution to the corresponding VS. Therefore, the approxima- 
tion algorithm is also a c-approximation algorithm for VS, a 
contradiction. ■ 



We can obtain a stronger result by relaxing the assumption 
NP ^ ZPP to F / NP. (It is a relaxation because 
P C ZPP C NP.) This can be done by examining the 
inapproximability proof for VS (TJ. The inapproximability 
proof relies on the result that no polynomial time algorithm 
can approximate chromatic number to within n 1 ~ e for any 
e > unless NP — ZPP. Recently, it has been proved 
that no polynomial time algorithm can approximate chromatic 
number to within n 1 ^^ for any e > unless P = NP (6). 
Thus, we can change the assumption NP ^ ZPP to P ^ NP 
safely. 

Theorem 4. For any constant c > 1, there does not exist a 
polynomial time c- approximation algorithm for GLB, unless 
P = NP. 

D. Extending to generalized VS 

Our construction of VS-GLB and proof can be easily ex- 
tended to a general version of vector scheduling. In the current 
VS definition, all machines (partitions) are identical so that 
any job (vector) incurs the same vector cost to all machines. 
The machines can be generalized to be heterogeneous so that 
each job incurs a different vector cost to different machines. 
Formally, job i incurs vector cost pr" to machine j if i is 
assigned to machine j. The formulation and transformations 
can be slightly changed as follows. In the integer program 
formulation of VS, change the objective to max,,-/. J^. x^p % ' . 

Change pa 2 in equation dip to be pf^ . For Lemma 2, 

(t ) — 2 
change x'^pn 2 to x„p^'. It can be verified that the proof of 

Theorem |2] is still valid with minor modifications. The online 

algorithm adopted later is also valid for this general version 

of vector scheduling. For simplicity, we mainly focus on the 

original VS model. 

III. Online Algorithm for VS 

Based on Theorem[2] we can solve VS by its corresponding 
VS-GLB. We review the approximation algorithm |3| for 
GLB, and then modify it to solve VS. 

Given a GLB instance and a positive number r, the algo- 
rithm [3 1 considers jobs one by one (in an arbitrary order) and 
assigns the current job to a machine to minimize the L T norrrj^] 
of the resulting load of all machines. Specifically, suppose jobs 
are numbered as 1,2, ... ,n, the same as the considered order. 
Suppose the load of machine k after jobs 1, 2, . . . , i — 1 are 
assigned is II . Then job i is assigned to the machine 

arg min ^ (Z]f 1 + c ijk ) T ^ 

The above optimization problem can be solved by trying each 
possible machine. During the optimization, the computation of 
the last step of L T norm, (•) 1//r , can be omitted. In addition, 
because the algorithm does not require the order of jobs 
and each job is assigned once, it can be implemented in 
an online fashion. This algorithm was originally proposed 

2 L T norm of a vector x = (xi, X2, ■ ■ ■ , xt) is defined as fyv sT) 1 / 7 ". 



for the traditional load balancing problem (5), and recently 
extended to the GLB problem 0. The parameter r controls 
the approximation ratio of the algorithm, as shown in the 
following lemma. 

Lemma 3 (Q, Q). Minimizing L T norm gives j^^l 1 ^ 
approximation ratio to solve GLB where I is the number of 
machines. 

Setting r = In I yields the best approximation ratio e log I. 
However, it is unclear whether the computation of L\ n i can 
be done in polynomial time. We consider this issue later. 

A. Adapting to VS 

To apply the above algorithm to VS, we can first solve 
VS-GLB and transform the solution to VS. This process can 
be simplified by omitting the transformation between VS and 
VS-GLB. 

Recall that the algorithm is to assign vectors one by one. 
Consider a vector pi in VS. To solve VS-GLB, this vector 
should choose a machine to minimize the L T norm of the 
resulting load. Due to the construction of VS-GLB, this 
vector can only choose from the anchor machines, otherwise, 
the resulting L T norm would be infinite (definitely not the 
optimal choice). Thus, this is equivalent to picking from the 
corresponding partitions in VS. After the assignment of any 
number of vectors that leads to partitions A 1} A 2 , . . . , A m , the 
L T norm of the load of machines in VS-GLB is, in fact, equal 
to 

f^\A 1 ,...,A m ) = (\\A4l + ... + \\A m \\l) 1/T 
where r 

k \ieAj J 

Suppose the assignment of vectors p\,pi, . ■ ■ ,Pi-\ leads to 
partitions A\, A2, . . . , A m . Let fj^J be L T norm of the result- 
ing load if vector pi chooses partition Aj, i.e., 

= f^(A 1 ,...,A j U{ Pi },...,A m ). 

Then, according to the algorithm, vector pt should be assigned 
to the partition 

• At) 

arg mm / > A 

j J 

The procedure is described in Algorithm [T] For each incom- 
ing vector, it only needs to execute Lines 5-9. 

Algorithm [T| with r = ln(mrf) is an e\og(md) approxima- 
tion algorithm to solve the corresponding VS-GLB. Thus, we 
have the following result due to Theorem [2] 

Lemma 4. Algorithm^ with r = ln(mc?) is an elog(md) 
approximation algorithm to solve VS. 

However, it is unclear whether Algorithm [T] with r = 
ln(md) can terminate within polynomial time. The algorithm 
requires the computation of x ln ( mc f or some x. First, the 
number ln(md) is irrational, thus cannot be represented by 



Algorithm 1: Vector Scheduling 



Input: m, the number of partitions; d, the dimension of 
each vector; px,P2> ■ - ■ ,Pn, the n vectors to be 
scheduled; r, the norm 
Output: Ax,..., A m , the m partitions 
begin 

for j from 1 to m do 

L A i <— 0; 

for i from 1 to n do 
if 3Aj,^ = then 



Ai 



else 



find j to minimize / 



(r). 



A,- 



polynomial bits to achieve arbitrary resolution. Second, even 
though we can approximate it by a rational number with 
acceptable resolution, the number x T may still be irrational, 
where f is the rational approximation to r. For example, when 
f = 1.5, there are lots of values of x such that a; 15 are 
irrational. Though we can still approximate it by a rational 
number, it is complicated to theoretically analyze whether 
the approximation ratio still holds and how the running time 
increases with respect to rational number approximation accu- 
racy. This problem has not been addressed in literature. 

Our solution is to round ln(md) to the next integer 
[ln(r7id)] and compute the Lpi„( m d)] norm. This guarantees 
polynomial running time, but causes the loss of approximation 
ratio. We show in the following that the loss is very small. 

B. Guaranteeing Polynomial Running Time 

To deal with the irrational number issue, we round ln(md) 
to the next integer [ln(mc?)] . In the following, we analyze the 
resulting approximation ratio. 



Theorem 5. Let I be the number of machines. Minimizing 
Lrjnj] norm gives elog(Z) + ^"fff 
solve GLB. 



approximation ratio to 



Proof: This result is obtained from Lemma [3] by per- 

l 1 ^. Consider the 



forming calculus analysis. Let g(x 
derivative of g, 

ln~2 



ln(2) 



/(.*) = 



1 - 



In/ 



For x > In I, it holds that g'(x) > so that the function g(x) is 
monotonically increasing. Since ln(Z) < |~ln(Z)] ) < ln(Z) + 1, 
we have 

g(\ln(l)])-g(lnl)<g(]nl + l)-g(lnl). 

In addition, consider the two points (InZ, g(lnZ)) and (InZ + 
l,g(lnZ + 1)). Due to Langrange's mean value theorem in 
calculus, there exists £ g [InZ, InZ + 1] such that 

.g(lnZ + l)- 3 (lnZ) = </(£). 



Since £ > InZ, we have I 1 /* < e. Additionally, £ < InZ + l, 



so 1 - 

elog(e) 
InZ+l 



lnj 



< 



InZ+l 

We have 



Therefore, 



9'(0 < R2) 



1 - 



lnj 



< 



g(\lnl])-g(lnl)<g(lnl + l)-g(lnl) < 



elog(e) 



InZ + 1 

Note that g(lnZ) = elog(Z). This completes our proof. ■ 
This theorem holds for general GLB problem, such as the 
one considered in [3 | [5| and |4]. Of course, it holds for VS- 
GLB as well. To have an intuition on the loss, we plot the two 
approximation ratios with respect to the number of machines 
in Figure [T] We can see that the loss is small. 
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Fig. 1. Approximation ratio loss due to rounding. Before rounding, the 
approximation ratio is y = e log(x) and it becomes y = e log(x) + ^'"^^ 
after rounding. 

We have the following corollary due to Theorem [5] 

Corollary 1. With r = [ln(meZ)"|, Algorithm [7] is an 
elog(mcZ) + approximation algorithm to VS, and it 

runs in polynomial time. 

The polynomial running time can be shown by the following 
analysis. We assume r = [ln(mcZ)] if not specified. The main 
time consuming step is to minimize f> r ) over j for given i. 
We can omit the computation of the outer 1/r power since 
function x v is monotonic for x > and y > 0. In computing 
L T norm, there is a basic operation, the integer power of a 
number, a T , where a is an element in any vector Aj. The 
naive approach, which multiplies a iteratively, involves r — 
1 multiplications. This can be improved by utilizing partial 
multiplication results. For example, computing a 8 as ((a 2 ) 2 ) 2 
only needs 3 multiplications. Generally, computing a T requires 
Llog(r)J +v(t) — 1 multiplications, where v(t) is the number 
of Is in the binary representation of r (Chapter 4.6.3 in Q). 
In the following, we put an upper bound 2 log r to the number 
of multiplications needed to compute a T . 

To compute />V for given i and j, it needs d + m — 1 
additions (adding pi to Aj, suppose Aj is maintained in each 
iteration) and 2mdlog(r) multiplications (md numbers, each 
needs to compute its r power). To find the optimal j for given 
i, we need to compute f£) for all j, and select the optimal 
one by comparison. This procedure needs m(d + m — 1) 



additions, 2d\og(T)m 2 multiplications, and m — 1 compar- 
isons. In summary, it takes 0(dlog(r)m 2 ) time for one vector. 
For the overall algorithm, it takes 0(dlog(r)nm 2 ) time. 
The computations can be sped up by exploiting the problem 
structure. The complexity can be reduced to 0(dlog(r)mn), 
dropping one m factor, as shown in the following. 

C. Computation Speedup 

Towards VS-GLB, we have the following lemma. Note that 
this lemma does not hold for the general GLB problem. 



Lemma 5. For any ji,j2, it holds that fj^ > f^ T J if and 
only if 



r(r) 



A h U {p l } 



- A 



Ji I 



> 



A 



J '2 I 



A h U { Pi \ 

1 1 A m 1 1 T T to both sides proves 



Proof: Adding ||Ai||; + . . 
the lemma. ■ 
Algorithm [2] shows the final design. For each partition Aj, 
the algorithm maintains two variables, the vector Aj (/ij in 
the algorithm) and its norm ||Aj||£ (5j in the algorithm). If 
there is no empty partition, then each incoming vector searches 

over all partitions to find the j to minimize 



Aj U { Pi \ 

Aj^ T (Lines 12-24). As Lemma jH] shows, this is equivalent 
to minimize /Jy . 

For the running time, consider a new vector that cannot 
find an empty partition. There are md additions (Lines 13,16), 
2mrflog(r) multiplications (Lines 14,17), 2(m — 1) subtrac- 
tions and to — 1 comparisons (Line 18). The dominating factor 
is mdlog(r). This is for one vector. For all n vectors, the 
running time is 0(mndlog(r)), compared to 0(m 2 nd log r) 
before speedup. Substituting r = [In (rod)] into the formula 
yields 0(nmd In ln(md)) running time, polynomial in the 
input length (note m < n). This analysis, together with 
Corollary [T] and Lemma [5] gives the following theorem. 

Theorem 6. Algorithmy. is an e\og(md) + approx- 
imation algorithm to VST It runs in 0(nmd\n\n(md)) time. 

It should be noted that we treat multiplications as basic 
operations in the above running time analysis. The running 
time will be different if we further consider the complexity 
of computing multiplications. Multiplying two n-bit integers 
takes time 0(n 159 ) for a recursive algorithm (Chapter 5.5 
in JH). Applying such analysis to Algorithm |2j however, 
requires the consideration of the length of the binary repre- 
sentation of each numeric value in the vectors, which may 
be complicated. Nevertheless, it is clear that multiplications 
run in polynomial time in the input length. Thus Algorithm [2] 
terminates certainly in polynomial time. 

IV. Conclusion 

In this work, we connect the vector scheduling problem with 
the generalized load balancing problem, and obtain new results 
by applying existing results to each other. On one hand, we 
show that generalized load balancing does not admit constant 
approximation algorithms unless P — NP. On the other hand, 
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rin(mrf)] 



Sped-up Vector Scheduling with t = 



Input: to, the number of partitions; d, the dimension of 
each vector; P i, P 2, ■ ■ ■ ,Pn, the n vectors to be 
scheduled 
Output: Ai, . . . , A m , the m partitions 
i begin 

for j from 1 to to do 
Ah 4— 0; 
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0; 

; 



// vector Aj 
II scalar ||X-||I 



for i from 1 to n do 



if 3Aj , Aj 
A, <- 



= then 

Aj Ufe}; 
\\Pi\\l\ 



else 



Jmin 



// partition index 



1; 

Mi +Pl> 

II Mmin || r ' 

for j from 2 to m do 

fi < — Hj+pf, II vector addition 

8<—\\m: 

if ^min - Sj miD > S 
Jmin ^ 3 ■> 

^min ^ &i 



Sj then 



- ,1. . 



we design a polynomial time algorithm for vector scheduling 
by using an existing algorithm for GLB and modifying it to 
guarantee polynomial running time. The resulting algorithm 
has two advantages. It can be performed in an online fashion, 
and it provides better approximation bound to solve VS than 
existing offline polynomial time algorithm. Note that we round 
ln(mef) to an integer to guarantee polynomial running time, 
which causes a small approximation ratio loss. It is unclear 
whether computing L ln ^ nd ^ norm directly, without rounding, 
can be done in polynomial time. 
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